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Abstract 

We consider distributed algorithms for data aggregation and function computation in sensor 
networks. The algorithms perform pairwise computations along edges of an underlying commu- 
I— — I nication graph. A token is associated with each sensor node, which acts as a transmission permit. 

Nodes with active tokens have transmission permits; they generate messages at a constant rate 
^ and send each message to a randomly selected neighbor. By using different strategies to control 

O the transmission permits we can obtain tradeoffs between message and time complexity. Gossip 

corresponds to the case when all nodes have permits all the time. We study algorithms where 
permits are revoked after transmission and restored upon reception. Examples of such algo- 
rithms include Simple- Random Walk(SRW), Coalescent-Random-Walk(CRW) and Controlled 
00 Flooding(CFLD) and their hybrid variants. SRW has a single node permit, which is passed on 

in the network. CRW, initially initially has a permit for each node but these permits are revoked 
gradually. The final result for SRW and CRW resides at a single (or few) random node(s) making 



(N 
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a direct comparison with GOSSIP difficult. A hybrid two-phase algorithm switching from CRW 
to CFLD at a suitable pre-determined time can be employed to achieve consensus. We show 
• • that such hybrid variants achieve significant gains in both message and time complexity. The 

. ^ per-node message complexity for n-node graphs, such as 2D mesh, torii, and Random geometric 

graphs, scales as 0{polylog{n)) and the corresponding time complexity scales as 0(n). The 
reduced per-node message complexity leads to reduced energy utilization in sensor networks. 



1 Introduction 

Large sensor systems have remarkable potential in a wide range of applications from environmen- 
tal monitoring to intrusion detection. Such systems are now viable thanks to recent progress in 
integration and communication technologies, yet their size precludes classical telemetry to collect 
sensory data and poses algorithmic challenges in handling the high data volume. The principle of 
gossip offers an appealing and scalable solution approach to this issue: In broad terms, gossip algo- 

*This research was supported by NSF Grant 0932114 and NSF CAREER awards ECS-0449194 and CNS-0238397. 
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rithms are decentralized methods to compute statistics of system-wide data based on asynchronous 

message passing between sensors that are within immediate communication range. The purpose of 
this paper is to introduce and analyze token-based gossip algorithms, and put them in perspective 
with previously studied gossip algorithms with respect to computation time, energy consumption, 
accuracy, and robustness. 

The generic gossip algorithm has been well-studied in the context of computing averages [7, 15, 
22], and has roots in load balancing (sec, for example, [5, Scction7.4]). Here it is assumed that 
each sensor has a local scalar value and it is of interest to compute the average of these values. 
Gossip algorithms accomplish this task by randomly choosing two neighboring sensor nodes at each 
time and replacing their current values by their average. It turns out that under mild conditions 
this process over time converges to the average of all sensor values at all the sensors, i.e., the 
sensors asymptotically achieve a consensus. The algorithm executes autonomously at each sensor; 
it is robust to communication errors and sensor failures; and a final state of consensus provides 
robustness and convenience in reading the average from the system. Such consensus algorithms 
have been recently explored in other contexts such as detection [1,20]. 

Nevertheless, these algorithms have fundamental disadvantages from an energy efficiency perspec- 
tive. For example it is well known that in grids and in tori with n nodes, the number of message 
transmissions per node to complete the computational task scales as Q{n) [7, 21], which can be sig- 
nificant for a large sensor network. The fundamental reason is that energy efficiency resulting from 
in-network processing is offset by ad-hoc message passing that results in redundant computations, 
i.e., the same set (or largely similar set) of nodes repeatedly fuse their information at different 
points in time. In a related problem involving distributed detection, the significant energy scaling 
can be attributed to the loopy nature of the network where messages sent from one node repeatedly 
arrive at the node at different points in time. In order to ensure that no information from any node 
is forgotten, each node must re-inject its value into the network to reinforce its information [20]. At 
a fundamental level the significant scaling of energy arises due to the slow mixing rate of large net- 
works, which can be attributed to rather large second eigenvalues of certain connectivity matrices 
associated with the underlying communication graph [7] [Theorem 3] . 

Another important disadvantage of generic gossip algorithms concerns accuracy. Stopping criterion 
of gossip is based on tail probabilities of a normalized distance between current system state and 
state of consensus [7]. In turn, these algorithms provide only probabilistic guarantees on final 
consensus and therefore they should be considered as approximation algorithms. It should also be 
noted that even when such a guarantee holds and the final normalized error happens to be small, 
the actual error may be substantial if the normalizing constant is large. Such situations arise in 
large networks in which a small set of sensor values significantly influence the sought value. 

To put the present paper in perspective with gossip algorithms it helps to consider signal processing 
and communications separately. Here we adopt the objective of exact computation of a function of 
distributed data, and consider performance of a novel class of communication algorithms towards 
that end. In this view conventional gossip may be considered as a communication algorithm to ap- 
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proximate the same function. This approximation is clearly one specific instantiation; and there are 
other wide range of approximation criteria that may be useful to consider from a signal processing 
perspective. From a signal processing point of view it also makes sense to incorporate prior signal 
information, such as boundcdness, distribution of values etc. While these issues arc important our 
focus here is primarily on the communication aspects for exact distributed computation. 

1.1 Token-Based Algorithms 

We present a novel concept of token-based gossip for distributed computing. This concept preserves 
the ad-hoc network operation but entails perfect accuracy of computation and exponential savings 
in energy-consumption over the existing local message passing algorithms. Under the algorithms 
studied here, a transmitting node becomes inactive and does not transmit further messages until 
it is reactivated by a message reception from another node. An active node generates messages 
at constant rate and sends each message to a randomly selected neighbor. Hence network nodes 
implement interactive sleep- wake schedules. Active nodes are interpreted to hold imaginary tokens 
that act as transmission permits. The total energy consumption is controlled by managing the 
number of tokens in the network. 

We describe different instances of token based algorithms: (i) Algorithm SRW maintains a single 
active node (i.e. a single token), whose trajectory is a random walk on the communication graph. 
Local processing at each active node exploits a decomposability property of the considered function 
to guarantee that the function is computed when each node becomes active at least once. In turn 
performance of this algorithm is closely related to the cover time [24] of random walks, (ii) Under 
algorithm CRW all nodes are initially active but when two active nodes communicate with each 
other their tokens coalesce and therefore the number of tokens in the system reduces by one. The 
computation is completed when a single token remains in the system. This latter algorithm is closely 
related with coalescing random walks [8] that have been studied as duals of a class of interacting 
particle systems coined as voter models. The two algorithms are illustrated in Figures l(a)-(b). 
A range of distributed algorithms can be obtained by variations of the token-based communication 
concept. For example, conventional gossip can be considered as a token-based algorithm where each 
node maintains a token at all times. The local processing upon each message exchange in this case 
is illustrated in Figure 2(a). Alternatively, one may consider hybrid schemes to improve message 
and time complexity. One hybrid scheme involves with a fixed but arbitrary number of tokens. 
Tokens progress from an active node to an inactive node while updating local values exactly as in 
SRW. If two active nodes interact then they both relax their values as in conventional gossip, and 
remain active. An illustration of such a scheme is given in Figure 2(b). Another hybrid scheme 
involves switching at some time t, from CRW to Controlled Flooding (CFLD) [6], which is a token 
based algorithm where tokens multiply at each local broadcast. In contrast to flooding, CFLD 
follows additional rules to control the number of transmissions(see Section 5). 
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Figure 1: Illustration of token-based algoritiims: (a) SRW and (b) CRW. Each node is either active or 
inactive. Active nodes are indicated with light color. Nodes can transmit information in the active state but 
not in the inactive state. A node can transition from active to inactive or vice versa based on pre-defined 
protocol. In SRW there is always one active node. In CRW every node is active initially but the number of 
active nodes decrease in time, towards the final value one. 
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Figure 2: Illustration of two algorithms in the framework of the paper, (a) In conventional gossip all nodes 
are active all the time. Transmission is two-way and two nodes that share information replace their prior 
values with the fused values, (b) Hybrid token algorithm that maintains a constant (in this case 2) number 
of active nodes. Transactions between an active and an inactive node are governed by the rules of SRW, 
while transactions between two active nodes are governed by rules of conventional gossip. 
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1.2 Time and Message Complexities 



In this section we describe tradeoffs between time and message complexities for regular graphs 
to illustrate some of the benefits of SRW and CRW. However, a fundamental difference between 
Gossip algorithm and SRW/CRW makes this comparison potentially difficult. Note that in the 
standard Gossip algorithm, the fused value is a consensus estimate at all the nodes. In contrast 
SRW/CRW realize the fused value at a random node in the network. In practice it is desirable 
to have access to the fused value at a designated node(s). Consequently, to meet this requirement, 
SRW/CRW would have to transmit the fused estimate to the designated fusion center. This raises 
two fundamental issues: 

(A) How can a node recognize that it has the fused estimate? 

(B) How to efficiently transmit the fused information to the designated fusion center? 

As it turns out, both of these questions can be addressed satisfactorily. To address the first 
issue we augment the distributed computation problem with a secondary distributed computation 
procedure. This secondary computation determines when fusion has been realized. To transmit 
this information to the designated node(s) we flood the entire network through CFLD. The overall 
message complexity for CFLD for a single message scales as the number of communication links in 
the network. The time complexity scales as the diameter of the network [14]. Furthermore, other 
choices can result in superior performance. For instance, nodes follow the CRW protocol until a 
predesignated time t and switch to CFLD after time t. 

Table 1 (see Section 4 and 6) illustrates completion times and per-node transmission counts for 
regular topologies. For the d-dimensional lattice torus with n sensors, we the completion time of 
CRW is G(n(logn)") and energy requirement per sensor node is 0{{\.ogn)"~^^) where a = 1 for 
d = 2 and a = for d > 3. The algorithm thus has a favorable energy scaling, furthermore its 
performance is almost insensitive to changes in the network connectivity represented by different 
values of d > 2, hinting at the possibility of predictable performance over mesh topologies. Both 
the time and the per-node energy requirement of the generic gossip algorithm of [7] scale as i7(n) 
on the 2-dimensional torus with n nodes. We also depict results for the two phase CRW + CFLD 
algorithm. For the 2D torus the time complexity is similar to GOSSIP and the message complexity 
is similar to CRW. Therefore, this two-phase scheme is an improvement over both GOSSIP and 
CRW. Recent results indicate that the energy scaling can be improved significantly via variations 
of gossip that require location awareness capability for each sensor [11, 16]. Similar variations of 
token-based gossip may also yield reduction in energy complexity, though that direction is not 
pursued in this paper. 

The conclusions of Table 1 (sec Sections 4 and 6) for regular topologies is presented in Section 3.2 
and is largely based on existing results in applied probability literature. Preliminary work along 
these lines has also been described by the authors [21]. This paper develops results for general 
topologies based on a deeper analysis of hitting-time computations on the communication graph. 
While our approach can lead to conservative estimates for general graphs, it turns out that for 
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Table 1: (a) Message and (b) time complexities with n nodes. 



graphs with local neighborhood structure, such as grids and random geometric graphs (RGG), these 
estimates are relatively tighter. Our message complexity for RGG scales as 0(log^(n)) and can 
be combined with CFLD to realize consensus with significant improvement in message and time 
complexity over GOSSIP. 

Robustness: Similar to other token-based communication algorithms such as token rings, robust- 
ness of the introduced algorithms suffers from the potential of losing tokens. Packet losses do not 
contribute to this potential if reliable link protocols are adopted. However permanent node failures 
may have an impact on system performance if a node fails while holding a token. The issue may be 
mitigated by running multiple independent instances of a token-based algorithm simultaneously, in 
order to reduce the likelihood of losing all tokens simultaneously at a failing node, without altering 
the scaling of time and message complexities. As will be clear in the sequel, a token-bearing node 
knows explicitly the number of sensor values fused in its current value; and that value may be useful 
partial information in case of node failures. Alternatively a hybrid scheme with multiple tokens 
may be invoked, thereby providing a robustness akin to conventional gossip. 

Paper Organization: The paper is organized as follows. In Section 2 we formalize a general dis- 
tributed computation problem that includes computation of statistics of spatially dispersed sensory 
data. The two token-based gossip algorithms SRW and CRW are formally specified in Section 3 and 
their correctness is established. Section 3.2 illustrates time and message complexities for regular 

topologies as summarized by Table 1. Section 5 describes two-phase algorithms that combine CRW 
with CFLD to realize a consensus fused estimate. Section 6 gives a novel analysis of coalescing 
random walks on arbitrary graphs and thereby determines the complexity of CRW in the general 
setting. 
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2 Problem formulation 



We consider a collection of n networked sensors. The communication graph of this collection is 
an undirected graph G = {V, E) in which each sensor is uniquely represented by a node in V . An 
edge in E indicates that the two sensors corresponding to its incident nodes have a bidirectional 
communication link. In order to avoid trivialities G is assumed to be connected; otherwise it is 
arbitrary. 

Each sensor i has a value Xj and we arc interested in computing a function Ffii^xi^ * * * ? ) of 
the n sensor values. The function -Fn(') is assumed to be symmetric so that for any permutation tt 
of {1,2,--- ,n} 

Furthermore we assume existence of an atomic function /(•) such that for each 1 < A; < n 

Fn{xi,X2,--- , Xn) = /(i^fe(aJ7ri) ■ ■ ■ ) ^Tr^)) -^n— A;(^7r/s_|_i j 2;7rj._|_2 , ■ ■ • )3^7r„))- (1) 

In particular 

f{xi,Xj) = F2{xi,Xj). (2) 

For example if f{xi,Xj) = m.ax{xi,Xj) or if f{xi,Xj) = Xi + Xj then Fn{-) is respectively the 
maximum or the sum of xi, X2, ■ • • , x^. If each Xi = {yi, Wi) is a vector and /(•) is the vector-valued 
function 

f{xi,Xj) = {{wi + Wj)~^{wiyi + Wjyj) , Wi + Wj), 

then Fn{-) is the tuple 

(n n \ 

X] \^ — ' XI • 

Weighted average computations of this sort are particularly relevant to applications of Kalman 
filtering in distributed tracking [19]. 

We finally assume existence of a special value e that acts as an identity element for /(•) so that for 
any value x 

f{x,e) = f{e,x) = /(e,e) = e. 

This element is not a fundamental requirement, and in fact it makes the upcoming algorithm spec- 
ifications look somewhat mysterious, but it is useful in giving a concise reasoning about correctness 
of the algorithms introduced next. 

3 Token-based Gossip Algorithms 

We specify two algorithms, namely SRW and CRW. A pseudo-code for these algorithms is given in 
Figure 1. Under each algorithm, each node maintains three variables value, status, and count. 
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Algorithm 1 Pseudo-code for algorithms SRW and CRW at node i. Send() is activated by the local 
Poisson clock at the node, and Receivc() is activated by message reception from some other node. The two 
algorithms differ in the initialization of the variable status. Each algorithm terminates when count is equal 
to the number of nodes in system. 

Variables: status, value, count. 

Initialize: value ■<— xf, count ■<— 1; 



SRW : status 



I 'active' if i = 
I 'inactive' else. 
CRW : status 'active'. 



Procedure Send() 
if( status == 'active' ) { 
choose neighbor; 
send(neighbor , value, count); 
value -f- e; 
count <— 0; 
status <— 'inactive'; 

} 

Procedure Receive( value.in, count_in) { 
value /(value, value _in); 
coimt ■<— coimt + coimt_in; 
status 'active'; 

} 



Content of value is an estimate of Fn{xi,X2, ■ ■ ■ -Xn)- Initially value= Xi and count= 1 at each 
node i. The variable status is either 'active' or 'inactive' and it indicates whether the node is 
holding a token or not. Let 

content of value at node i at time t, 
J 1 if status of node i is 'active' at time 
1^ else. 

content of count at node i at time t. 

Hence fj(0) = Xj and Cj(0) = 1 for each node i. The initial value ^j(O) (i.e. of status) depends on 
the particular algorithm: Under SRW ^i(O) = 1 for exactly one node, say node ig, whereas under 
CRW Ci(0) = 1 for all nodes i. 

Variables value, status evolve according to the same rules under both algorithms: Namely, each 
node has an independent Poisson clock that ticks at unit rate. When the local clock of a node 
ticks, the node does not take any action unless it is active at that time. If the node is active, then 
it chooses a neighbor at random, sends its current value and count to that neighbor, and becomes 



Vi{t) = 

m = 

Ciit) = 
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inactive. The Send() subroutine of the algorithm maintains value and carries the identity e. The 
count is at each inactive node at all times. If the selected neighbor was inactive at the time of 
reception, then it simply adopts the variables of the sender and it activates itself. Otherwise, the 
node executes a step towards computation of and adds the received count value to its own. 

For each time t define ^(t) = • • • :Cn{t))- The process • • • ,(.n{t)) of activity 

indicators is Markovian due to the randomness of the choice of neighbor by active nodes: For SRW 
the unique 1 in ^2(^)7 • • • ,S,nit)) follows a simple random walk on the communication graph. 

For CRW {^i{t) , ^2{t) , • • • ,^nit)) indicates sites occupied by n simple random walks each of which 
evolves independently until it meets another and makes identically the same transitions with that 
walk afterwards. 

3.1 Algorithm Correctness 

We first establish correctness of the introduced algorithms by showing that each algorithm computes 
the quantity Fn{xi,X2, • • • , Xn) in finite time. 

Lemma 3.1 Under both SRW and CRW, and for all t > 0, 

Fn{vi{t),V2{t),--- ,Vnit)) = Fnixi, X2, ■ ' ' ,Xn). 

Proof. We prove the lemma by induction. Let to = and let tk be the time of A;th message 
passing in the system. The claim is true at time to due to the initialization of each algorithm. 
Since the system state remains constant in the interval it is enough to show the claim 

holds at time provided that it holds at time tk- To this end suppose the claim holds at time 
tk. Without loss of generality, suppose that the k + 1st message has sender 1 and receiver 2. Then 
vi{tk+i) = e, V2{tk+i) = f{vi{tk),V2{tk)) and = vi{tk) for 3 < Z < n. Therefore 

Fn{vi{tk+l),V2{tk+l), - ■ ■ ,Vn{tk+l)) = Fn{e, f {vi{tk) , V2{tk)) , V3{tk) , ■ ' ' ,Vn{tk)) 

= /( F2{eJ{vi{tk),V2{tk))) , Fn-2{v3{tk),--- ,Vn{tk)) ) 

= /( F2{vi{tk),V2{tk)) , Fn-2{v3{tk),--- ,Vn{tk)) ) 

= Fn{vi{tk),V2{tk),--- ,Vn{tk)), 

where the second and fourth equalities are due to (1) and the third equality is due to (2). This 
establishes the induction step and in turn the desired conclusion. □ 

We first consider SRW and define 

Ts = inf{t : each node becomes active at least once by time t }. 

Let a{t) denote the node that is active at time t > ts- Since each node has been active at least 
once by time t, every node other than a{t) should have turned inactive by sending the token to a 
neighbor. Therefore at each node i 7^ a{t) the value is Vi{t) = e. Prom Equations (l)-(2) we get 

Fn{vi{t),--- ,Vn{t)) = F2{Vi^t){t),e) = 
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Therefore by Lemma 3.1 the value Va(t)(t) of node a{t) is equal to Fn{xi, - ■ ■ , Xn)- 

The above argument applies verbatim to CRW also, by taking a{t) to be the unique active node at 
time t > Tc where tc is defined as 

rc=inf{t:^ei(t) = l}. 
i 

It should be clear that for SRW ts is the cover time of the communication graph G and it is finite 
with probability 1. Under CRW, Yli^ii'^) is a non-increasing process with an absorption state at 
1. In this case tc is the absorption time of this process and it is almost surely finite. Hence under 
each algorithm the desired quantity Fn{xi, - ■ ■ ,Xn) is available at some node within finite time. 

It remains to establish how the termination times ts,tc can be recognized. Towards this end 
note that the update mechanism for variable count reflects a secondary distributed computation 
procedure with f{ci, Cj) = Ci + cj and 

Fn{ci{0),--- ,c„(0)) = ^Ci(O) = n. 

i 

The general conclusions obtained above are valid in this special case, and in turn Cj(j)(i) = n for 
t>Ts under SRW and for t>Tc under CRW. Therefore at such time instants node i can identify 
itself as the unique bearer of the desired quantity by verifying the condition Ci{t) = n. In other 
words variable count keeps an account of how many sensor values have been fused so far to form 
the content of variable value; this serves as a pilot signal with a known terminal value that signals 
the end of each algorithm. We collect these observations in the following theorem: 

Theorem 3.1 Both SRW and CRW compute the exact value of Fn{xi, • • • , x„) in finite time. Each 
algorithm terminates with the correct value when the content of variable count reaches n, the system 
size, at some node in the system. 

3.2 Time and Message Complexities 

We present execution time and message complexity for SRW and CRW. We begin this section with 
the definitions of message and time complexities. 

Definition 3.1 Average time complexity of SRW (resp. CRW) refers to E[ts\ (resp. E[tc\). 

In adopting a measure of messaging complexity, let rjs{t) and ric{t) be the total number of trans- 
mitted messages in the network by time t under algorithms SRW and CRW respectively: 

Definition 3.2 Average per-node message complexity of SRW (resp. CRW) refers to n'~^E[r]s{Ts)\ 
(resp. rr'^E[r]c{Tc)])- 

Note that {i{t) : t > 0) is a Markov process under both algorithms. More precisely, {^{t) : t > 0) 
is a random walk on G under SRW, and a coalescing random walk on G under CRW. In particular 
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the average time complexity of SRW is the mean cover time of G. Average message complexities 
of the algorithms are characterized by the following lemma: 



Lemma 3.2 



E[ris{rs)\ = E 
E[iic{tc)] = E 



i 



E[ts], 



Proof. We provide a proof that applies verbatim to both algorithms. Let r){t) represent r]s{t) 
for SRW and r)c{t) for CRW. Let Tt denote the sigma-field generated by (^(s) '■ s < t) and let 
{<^{t) : t > 0) be a Poisson process with unit rate. Note that r]{t) has the same distribution as 
(;/!)( Jq* ^j(s)(is) since each carrier node transmits messages at unit rate and inactive nodes do not 
engage in message transmission, and thus X^j^i(i) is the instantaneous rate of message generation 
in the network at time t. In particular the process '■ t > 0) with 



li{t) = r]{t) 



(3) 



is a martingale adapted to {J^t}- Both ts and tc are { Jt}-stopping times and they are almost 
surely finite. In addition sup^>Q (t) < n; hence it follows by the optional sampling theorem [12, 
Theorem 2.2.13] that E[ij,{ts)] = E[ij,{tc)] = 0. Using this observation in equality (3) establishes 
the lemma. □ 



4 Regular Topologies 

Since the two algorithms are closely related to random walks, their time and message complexities 
for special topologies of the communication graph G can be deduced by referring to related work 
in applied probability. Before giving an analysis for general topologies in the next section, we 
consider here the cases when G is completely connected (i.e. clique) and d-dimensional torus for 
d > 1. Recall that average time complexity of SRW is the mean cover time of G and by the above 
lemma this is proportionally related to the mean message complexity. We refer the reader to [2, 
Chapter 5] for the cover time results, which are summarized in Table 1 in the column SRW. We 
articulate on the time/message complexity of CRW in more detail, as that entails consideration of 
coalescing random walks, which are relatively obscure in engineering applications. 

Completely connected graph: A completely connected graph is a graph where each vertex has 
an edge with every other vertex. In such graphs the process (Yli^iit) | i > 0) is also Markovian. 
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This follows from the fact that the transition matrix at any time instant is invariant to permutation. 
For CRW this process has initial state X^jCi(O) = n and at time i < re it decreases by one at 
instantaneous rate 

that is, the number of edges that are incident on two actives nodes at time t. This process has been 
studied in detail in [23] and the results therein are summarized in Table 1. While a completely 
connected graph reflects a limited set of cases of practical importance, its analysis interestingly 
sheds considerable light on mesh-type topologies that are considered next. 



Ring and d-dimensional torus: A d- dimensional torus is a graph where all the vertices have 
exactly 2d neighbors and it can be formed by joining the facing boundaries of a grid hence yielding 
a completely symmetric structure. A ring is simply a 1-dimensional torus. Consider n = N'^ for a 
d-dimensional torus. Let 

' if d = 1 

SN = I N'^logN [{d = 2 
^ iV^ if d > 3. 

An asymptotic analysis of coalescing random walks on d-dimensional torus for large is given 
by Cox [8]. It is established that the time-scaled the process J2i^ii'^Nt) on a d-dimensional torus 
converges in distribution to '^^ ^ completely connected graph as — ?■ oo. (Note that 

Si is even Markovian unless G is completely connected.) This result in turn leads to the 
following theorem on the time complexity for CRW: 



Theorem 4.1 [8, Theorem 6] 



< liminf £^ [re] /siv 



limsup£^[rc]/siv < oo. 



The average message complexity of CRW can also be quantified by building on the asymptotic 
characterization of [8]. Towards that end consider a typical sample path of the process (}2i^i{t) \ 
t > 0) illustrated in Figure 3. By Lemma 3.2 the average aggregate message complexity of the 
algorithm is the mean of the shaded area under the trajectory of ii{t) '■ < t < tc- The average 
per-node message complexity is then this quantity divided by the total number of nodes n. 

Let 

' if d = 1 

mjv = < Ar2(logAr)2 if d = 2 
^ N'^logN if d> 3. 
The following theorem provides an upper bound for the message complexity of CRW. 



Theorem 4.2 [21, Theorem 7j 



limsup£^[r7c(rc)]/mjv < oo. 
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Figure 3: A sample path of the number of active nodes under algorithm CRW. denotes the first time that 
fc active nodes remain in the network so that tc = o\. The mean of the shaded area is the mean aggregate 
number of transmitted messages in the network. 



To complement Theorem 4.2 a lower on the growth rate of E\r]c{Tc)\ is provided by Theorem 4.1: 
Since Ylii^i^) ^ 1 all t it follows that E\r]c{Tc)\ > E[tc]; in turn 



By comparing sn and rriN we conclude that this bound is tight for a ring, i.e. a 1-d torus, and 
it is off by at most a factor logA^ for d > 2. For a relatively concrete view of these quantities 
Figure 4 provides a summary of numerical simulations for time and message complexities of CRW 
on 2-dimensional tori of varying sizes. 

5 Two- Phase Algorithms 

The algorithm would work as follows: In the first phase the nodes in the network follow a CRW 
protocol upto some designated deterministic time t. At this time there are r]c{t) tokens left in the 
system. The set of nodes that have tokens at time time t then flood their messages to all the nodes 
as described below. 

Controlled Flooding (CFLD) algorithm assumes no network topology. It works by forwarding the 
message over all links. From the source node message is sent to all neighbors. Each node, v, 
receiving its first message from vertex u sends messages to all neighbors except u. Also, each node 
will transmit its packet at most once [6]. If no message is received then it docs nothing. It is well 
known that the message complexity scales as Q{\edges\) since each edge delivers the message either 
once or twice [6]. The time complexity scales as Q{diam{G)) since we must reach all nodes [14]. 
We can adapt CFLD algorithm to our scenario where wc have a random number ric{t) tokens left 
in the system. At time t all nodes cease to implement CRW. Nodes with tokens separately CFLD 
messages. Each node then fuses the data once all the messages are received. 



limini E[ric{Tc)]/sN > lim inf £^[rc]/siv > 0. 




13 



There are two fundamental questions that arise: 

(1) How do nodes rcaUzc that they have all the messages; 

(2) Can we ensure that consensus is achieved through this process. 

The answer to the second question lies in Lemma 3.1, which asserts that the data fusion is invariant 
to order of reception. Consequently, we are left to address the first requirement. Here we invoke 
the secondary distributed computation scheme described in Figure 1 and Section 3. Each active 
node at time t has its individual variable covmt. During the CFLD phase each node forwards this 
count variable in addition to its fused value. Each node can then determine whether it has received 
all the messages by updating its private covmt variable. 

There are three principal advantages for combining CRW and CFLD: 

(A) Consensus is obtained in finite time. 

(B) CRW slows down when there are few tokens increasing time complexity. The two phase 
algorithm substantially improves time complexity. 

(C) Analytical bounds for message and time complexities for general graphs can be established. 
This is because it is easier to determine the expected number of tokens left in the system at any 
time. 

Next we will present message and time complexity for the combined algorithm. Recall the n-node 
communication graph G = (V, E) with link set E and nodes V\ Let c?„ denote the degree of node 
V and diam(G) denote the diameter of the graph. 

To simplify the exposition we denote, 



Note that 1 < N{t) < n and iV(0) = n, and ^i{t) is as before the state of node i at time t. 

We compute the time it takes for the expected number of active tokens to be below some positive 
integer 7. This leads to the following definition. 

Definition 5.1 ^y-time complexity is the time T-y it takes for the CRW on an n-node graph to have 
an average of ^ active tokens left in the system, i.e., 



Observe that unlike the termination time, tc defined in the Section 3.2, T-y is no longer a random 
variable, which as we will see in Section 6 will simplify our analysis. 

Define message complexity for the CRW until time t: 



The time complexity, T, is the sum of the time complexities corresponding to the two phases. 
Consequently, for the case when all the tokens during the CFLD phase are transmitted at a unit 




(4) 



T-y = min{t > I N{t) < 7} , 7 = 2, 3, . . . 



n 




(5) 
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rate we have: 



T < + 0(diam(G)) 



(6) 



Theorem 5.1 The overall message complexity, M{t) for the two phase scheme where nodes follow 
CRW upto time t and CFLD after time t is less than M(t) + 2(^^gy (i^)Ar(t). For t = Tj, the 
overall message complexity is less than Mc(T-y) + 2(^^^y dv)N(T^). 

Proof. Suppose, the CFLD phase starts at time t then the overah number of messages, ri(t) is the 
sum of messages transmitted during the CRW phase, rjc{t)-, and that transmitted during the CFLD 
phase, r]F{t) starting at time t. Specifically, we have r}{t) = ric{t) + r]F{t). Taking expectations on 
both sides we obtain: E{r}{t)) = M{t) + E{r]F{t)). We can simphfy the expression for the second 
phase by noting that(see [6]), E{r)F{t)) < ^{J2veV ^v)N{t)- The proof now follows by substitution. 
□ 

6 Time and Message Complexities for General Graphs 

This section describes techniques for estimating the message and time complexity of CRW for gen- 
eral graphs in both continuous and discrete time settings. Bounds for SRW reduces to computation 
of cover times. Cover time bounds for many of the graphs considered in this paper are available in 
the literature and we do not develop these results here. 

To develop results for CRW we will follow the two-phase procedure outlined in the previous section. 
The advantages of the two-phase algorithm has already been outlined in Section 5. We recall one 
main advantage that is pertinent here, namely, from Theorem 5.1 it follows that we do not have to 
seek bounds for the stopping times tc- Rather we only need to determine the expected number of 
active tokens at a deterministic time t. 

This section is organized as follows. First we establish straightforward results for general graphs 
based on bounds on the worst-case mean hitting time. We show that the number of active tokens 
at time t decays as 0(nexp(— |)), where a is the worst-case mean hitting time on the graph. We 
compute a network-circuit resistance analogy. We then compute complexity bounds for a number of 
graphs such as expanders and meshes. While this bound is general, it turns out to be conservative 
in estimating the message complexity. The main reason is that the worst-case mean hitting time is 
generally large for many graphs and a local analysis is required. This motivates a careful study of 
message complexity based on local analysis of random walks. Specifically, we consider graphs with 
geometric structure. We show that for such graphs the message complexity scales as 0(log^(ra)) 
paralleling our results for the torus in Section 4. 

As before let Xt denote the state of a random walk on a graph G = {V,E). For continuous time 
we consider unit rate random walks and assume that multiple random walks are independent. 
Analogously we also consider discrete time independent simple symmetric random walks and al- 
low self- loops in the graphs. Our definitions and results typically apply to both continuous and 
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discrete setups based on the so called jump-and-hold description and a continuization argument 
as described in [2] . Nevertheless, wherever appropriate we will point out specifically whether our 
results apply to discrete or continuous time scenarios. 

Analogous to Equation 5 for the continuous time setup, the message complexity in the discrete 
setup is given by, 

t 

Mc{t) = Y,N{s) (7) 

s=0 

We denote the first hitting time of node v by Ty, i.e., r„ = \rd{t >Q\Xt = v}. We also denote by 
T^w the hitting time for a random walk starting at v and hitting w for the first time. 

= mi{t >Q\Xq = v, Xt = w} 
The worst-case hitting time is denoted as a, i.e., 

(7 = max EiTyyj) 

Let Cyw be the first time that two independent unit rate continuous time random walks, Xt,Yt, 
on graph G = {V,E) started at nodes v and w coalesce(meet), i.e., 

= inf{i >Q\Xt = Yt,, Xq = v,Yo = w] 

The meeting times for two independent copies of random walks in continuous time is related to the 
worst-case hitting time. Specifically, Aldous [2] (Proposition 5, Chap 14) uses Martingale arguments 
to show that, 

max£;(C„tt,) < a (8) 

v,w 

Let as{A) denote the worst-case coalescing time probability on a subset A C V, i.e., 

as{A) = min Proh{Cyw < s) (9) 

v,w&A 

Note that by union bounding we obtain, as{A) > as{V). Now through Markov inequality together 
with Equation 8 we get a bound on the probability of meeting time, i.e., 

a,{A) > asiV) > 1 - ^ (10) 

^Specifically, as described in [2] the continuous walk can be constructed by the two step procedure, namely, (1) 
Run a discrete time chain with the simple symmetric transition matrix; (2) Given the sequence of states, Vj £ V, j = 
1, 2, . . . , m visited by the discrete time chain, the duration of time spent at each state, Vm is a unit rate exponentially 
distributed random variable. This continuization is particularly useful since useful quantities such as mean hitting 
times etc. in the continuous case corresponds directly to the mean number of discrete time steps required in the 
discrete time chain. 
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We next consider decomposition of the original graph into disjoint subgraphs and bound the total 
coalescing time by the union of the coalescing times for the subgraphs. Let [tj denote the greatest 
integer smaller than t. Suppose Ai,i = 1, 2, . . . , m{t) be a partition of the vertices of the graph 
and At denotes the collection, i.e., 

m{t) 

\jAi = V, Aif]Aj = il}, i^j, At = {Ai,A2, 

i=l 

The worst-case coalescing time, as(At), over this sub-collection is defined by 

as{At) = min as{Ai) (11) 

l<j<m{t) 

Theorem 6.1 Consider the partition of the graph into subsets, {Ak}, as described above. Suppose 
1 < m{t) < i.e., the number of partitions is smaller than one half the expected number of active 
tokens at time t. It follows for both continuous and discrete time setups that, 

N{t + s)< N{t) exp (-^asiAtyj ; 0<s<t, N{t) > 2. (12) 



Furthermore, suppose t<r<r + s<2t and the number of partitions are chosen such that 

4 



1 < m{t) < ^ and N{t) < 2N{2t), then it follows that, 





t 


{- 


2s_ 



a.s{At) ; < s < N{t) > 2. (13) 



N{2t) < N{t) exp 

The proof of the theorem appears in the appendix and is based on the arguments presented in 
Cox [8] for the torus. We exploit the salient steps there to extend it to general graphs. 

Observe that if the coalescing time of two walks is a constant then the number of active tokens 
decreases exponentially fast. However, the meeting time can be large, namely, the probability that 
two walks meet in a short time can be very small. Note that since < as{At) < 1 the right hand 
sides of Equation 12 is larger than N{t)/y/e. Consequently, Theorem 6.1 is not useful for large 
incremental times s. Therefore, this result will be used as an intermediate step in an iterative 
process over many increments to provide useful bounds. 

We will now use Theorem 6.1 to prove the 7 time and message complexities for arbitrary connected 
graphs. We have the following theorem. 

Theorem 6.2 Consider the algorithm CRW on an arbitrary connected graph, G = {V^E). The 7 
time complexity for 7 > 2 scaled as 0(a\og{n/^)). The 7 message complexity for 7 > 2 scales as 
0{nalog{n/j)). 

Proof. In Theorem 6.1 we choose a single partition, i.e., At = {V}. For this case we note that the 
worst-case meeting time as{At) = as{V). Consequently, we can apply Markov inequality described 

^The O(-) notation here and in the rest of this section for time and message complexity imphes that the bound 
holds for sufficiently large time for a fixed n-node graph. 
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by Equation 10 to obtain as{V) > 1 — a/s. This bound only makes sense if s > a. We choose time 
increments s = 2a and partition T = 4(7log(n/7) into 21og(n/7) increments. For each increment s 
we obtain from Equation 13 that, 



N{t + s)< N{t) exp(-(l - a/s)) = N{t) exp(-l/2) 



Repeating this 21og(n/7) times we get 



N{T) < Ar(0)(exp(-l/2))2i°g("/T) 



= 7 



where the last equality follows from the fact that A^(0) = n. The 7 message complexity directly 



In the next section we will now apply Theorem 6.2 for specific graphs to obtain bounds on time 
and message complexities. 

6.1 Time and Message Complexity Based on Hitting Time Characterization 

Our goal in this section is to use well known bounds on hitting times for some well known graphs 
together with Theorem 6.2. 

For general graphs Aleliunas et al [3] showed a general upper bound a = 0{\E\\V\), for the worst- 
case hitting time, where \E\ is the number of edges and \V\ is the number of nodes (vertices). If 
the maximal degree of the graph is -Dmax then \E\ < nDmax and \V\ = n. This implies that the 7 
time complexity scales as 



We note that this result is generally conservative in comparison to the time complexity of 2D torus 
described in the previous sections. This is because this hitting time bound is conservative. We 
invoke resistance characterization of hitting time to obtain sharper bounds. 

6.1.1 Resistance characterization for Connected graphs 

Chandra et al[9] establish bounds for hitting time between any two nodes based on resistance 
of electrical networks. Note that the resistance bounds apply generally to discrete time walks. 
However, note that there is a close relationship between the discrete and continuous time random 
walks based on the so called jump-and-hold description described earlier, which results in similar 
results for continuous time with appropriate time scaling. 

The electrical network is obtained by replacing each edge in the graph with a one-ohm resistor. It 
turns out that the worst-case mean hitting time satisfies 



follows from Equation 5. 



□ 



77)) 



a < max 2|£^|pi 

u,vEV 



'uv 




(14) 
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where puv is the effective resistance between nodes u and v. Consequently, if -Dmax is the maximum 
degree for the graph and p* is the maximum effective resistance between any two nodes in the 
network, we get 

<2nlog(n/7)D^axP* (15) 

Expander Graphs: An (n, Dmax) ct) expander is a graph G = {V, E) on n vertices of maximal 
degree Dmax such that every subset A<zV satisfying |^| < n/2 has \N[A) — A\> a|A|, where 

N{A) = {veV\{u,v)eE,ueA} (16) 

For an (n, -Dmax, oc) expander graph with minimum degree -Dmim the worst-case resistance is equal 
to 

24 

P 



a2(£'min + l)' 
Consequently, the 7 time complexity scales as: 

nlog(n/7)Ana^ 
a^(i^.in + l) 

For an expander graph, where .Dmin ~ ^^max we get a 7 time complexity scaling as 

= 0(n log(n/7)) 

We note that this result is close to the time complexity bounds obtained for a completely connected 
network in the previous section. 

2D Mesh: From the resistance calculations it turns out that 

p* = C»(log(n)) 

for 2D mesh [9] with n nodes. Consequently, for the 2D mesh we obtain 

T^ = 0(nlog2(n/7)),7>2 

This is within a log(n) factor of the bound obtained for the 2D torus using more elaborate martingale 
calculations in the previous section. Note that unlike the 2D torus the 2D mesh is not symmetric 
and results of the previous section cannot be directly applied here. 

Random Geometric Graphs (RGG): A 2D Random Geometric Graph with n nodes and radius 
r(n), denoted by = (V, E), is a graph where nodes are uniformly distributed in the unit square 
and (n, v) ^ E \i and only if the Euclidian distance between nodes u and v is smaller than or equal 
to r(n). 

It is well known that when the radius of connectivity is chosen as r(n) = \/2\og n/n, the graph is 
connected with high probability. Furthermore, Avin and Ercal [4] (Theorem 5.3) show that, with 
high probability, the resistance scales as 

p* = 0{l/nr\n)) 
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and the number of edges scales as \E\ = 0{n'^r^{n)) (see [4] Corollary 3.5) for this choice of 
connectivity radius. Consequently, the worst-case mean hitting time scales as a = 0{n) with high 



While the time-complexity bounds obtained using resistance characterization appears to be tight 
for several cases, the 7-message complexity is overly conservative. This is because the worst-case 
mean hitting time, a is Q{n) in general. Theorem 6.2 implies that the 7 message complexity 
scales as 0(n^ log(n)) even for a 2D torus. This is significantly weaker than the complexity bounds 
obtained for the torus in Section 3.2. Motivated by these reasons we develop a new characterization 
of message and time complexities based on local geometric analysis of random walks. 

6.2 Logarithmic Bounds for Message Complexity 

The main conservatism in Theorem 6.2 arises from the fact that the meeting time is bounded in 
terms of the worst-case hitting time. Specifically, if two random walks start relatively close to each 
other we expect that the meeting time is relatively small, i.e., the meeting time should typically 
scale with initial distance between the two walks. In this section we develop these ideas further 
for graphs that have a geometric neighborhood structure. We focus on discrete time walks since 
the analysis is technically simpler. Each active token follows an independent, simple, symmetric 
random walks on the graph G = {V,E). Specifically, at each step an active token moves to a 
neighbor of its current location, chosen uniformly at random and the moves of all the active tokens 
are synchronized (this assumption is not restrictive since we allow self- loops). 

The basic idea is based on local behavior of random walks. Specifically, it turns out that for 
graphs that are endowed with a geometric neighborhood structure it is possible to characterize the 
probability that two random walks meet in terms of their initial graph distance. We emphasize that 
while in general there is always a non-zero probability that two random walks meet, this probability 
has often been characterized in terms of the entire graph. Indeed this was the basic reason for the 
conservatism of resistance based bounds derived in the previous section. Therefore, to overcome 
this issue we will develop results based on local behavior of random walks. Our main result in this 
section (see Theorem 6.3) will establish that under certain regularity conditions on the graph the 
expected number of active tokens at time step t decays inversely with t, i.e.. 



We again emphasize that the 0{-) notation above and in the rest of this section refers to time 
asymptotics for a fixed n-node graph. The result implies a bound on both 7 time complexity and 7 
message complexity. The 7 time complexity scales as 





(17) 



ralog(n) 



, 7 = 2, 3,..., 



n 



7 
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Note that the 7 time complexity bounds are order-wise similar to those derived using resistance 
arguments in the previoTis section. However, the main advantage here is that we can now obtain a 
bound on message complexity based on Equations 7, 21: 

T 

M(T^) < nlog{l + t) ^ ^ o(log(n))) (18) 

t=i ^ 

Thus the message complexity per node scales as n^^rjj = 0(log^(n)). 

This result is based on the fact that for many graphs the probability that two walks at a distance 
R meet in time is bounded from below by the l/log(i?). To precisely describe these ideas we 
introduce some notation. Let d(u,v) be the graph distance between the nodes u, v ^ V, i.e., the 
minimal number of edges in any edge path connecting u and v. We denote by B(u, R) the ball 
centered at node u and radius R, i.e., 

B{u, R) = {veV\ d{u, v) < R} 

The volume of a set, A CV, denoted by Vol{A), is the number of edges contained in the ball. The 
volume of the ball, B{u, R) is denoted by Vol{u, R) for simplicity. Note that if dv is the degree of 
node V then we have, 

Vol{u,R) = Vol{B{u,R)) = ^ d„ 

veB{u,R) 

Next we denote by P{u, v) the 1-step transition probability of going from node u to node v. Since 
we consider simple symmetric random walks, this transition probability is the inverse of the degree 
of node u \i u and v are connected and zero otherwise. We also use Pt{u,v) to denote the t-step 
transition probability for going from u to v. We next present a precise characterization when 
Equation 17 holds. We will see that this bound holds when one has a geometric neighborhood 
structure as described below: 

Definition 6.1 A Graph G = iV, E) is said to satisfy a geometric neighborhood structure if there 
exists constants, Co, Ci such that 

CoR^ <\B{u,R)\; \B{u, R)f^B{u, R + A)\ < CiAR, ^ u e V, < A < R. (19) 

where, < R < Rmax and -Rmax is the diameter of the graph. 

Typically a graph that is approximately regular and has a geometric neighborhood structure sat- 
isfies such a property. The geometric random graph described earlier asymptotically satisfies the 
geometric neighborhood property. Indeed, note that due to the uniform distribution of the nodes 
in the unit cube this property is satisfied for sufficiently large n with high probability (see [4] for 
more details). The theorem below will evidently require only the lower bound. However, it turns 
out that to ensure a suitable bound on the meeting time probability the upperbound will also be 
necessary. 
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Theorem 6.3 Suppose the graph G = iV^ E) has a geometric neighborhood structure as described 
in Equation 19 and the meeting time probability satisfies: 

on2{B{u,t))>,^y t>0, ueV (20) 

for some constant C2 independent of time t. Note that af2{B{u,t)) is the meeting time probability 
(see Equation 9) for any two walks starting in the ball B(u, t) in time . Then the expected number 
of active tokens at time step t satisfies 

Nit) < C (^^^l°g|±l)) , t > 1 (21) 

where C = ^ max(8, ^^^^) when the number of active tokens is greater than 4. 

Note that smaller the constant Co the larger the number of active tokens at time t. We are now left 
to determine the conditions under which Equation 20 is satisfied. Surprisingly, it turns out that 

the logarithmic boimd holds if: 

(a) The t-step transition probability is approximately Gaussian. 

(b) Geometric neighborhood property as described in Theorem 6.3 holds. 
This result is stated below. 

Lemma 6.1 Consider the graph G = {V,E) satisfying the geometric neighborhood property as 
described in Equation 19. Suppose the t-step transition probability satisfies the so called Gaussian 
bound, i.e., 

exp < P^{u,v) + Pt+i{u,v); M u,veV, 1 < d{u,v) < t (22) 



t \ C4t 

where C3, C4, are positive constants independent of time. Then the probability of meeting time 
satisfies Equation 20 for some suitable constant G2 . Consequently, these conditions also imply the 
7 message complexity bound described by Equation 21. 

Note that the Gaussian t-step transition estimate bounds the sum of the transitions at t and t + 1. 
Note that for bi-partite graphs we must have either Pt or Pt+i equal to zero. Therefore, we cannot 
hope to improve this situation in general. However, if each node has self-loops it turns out that we 
can lower bound the t step transition probability directly, i.e., for non-bipartite graphs we have 

? (-^-^) ^ v);\/u,veV,\< d{u, v) < t (23) 

Our problem now reduces to finding those graphs that satisfy the t-step Gaussian transition prop- 
erty. It turns out that weak homogeneity conditions lead to the Gaussian t-step transition property. 
We describe what these conditions are next. 
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Volume Doubling Property: A graph G = {V^ E) is said to satisfy volume doubling property 
if volume of a ball centered at any point, u, with increasing radius satisfies 

Vol{u, 2R) < C^Voliu, R), y ueV, R>0 (24) 

We again point out that a 2D mesh satisfies such a property. The volume at a graph radius R and 
2R is smaller than i?^ and AR^ respectively for any R. A similar result holds for random geometric 
graphs(RGG) due to the so called gco-dcnse property [4]. For a constant fi > 1, a graph is said to 
be /i-geo-dense if every square bin of size A > r'^{n)/ fi (in the unit square) has nA nodes. Recall 
from Section 6.1.1 that any two nodes at a Euclidean distance r(n) is connected. Lemma 3.2 of [4] 
shows that with high probability if r^(n) = c^log(n)/n then RGG is /i geo-dense. Furthermore, 
if RGG is II geo-dense then, (i) Each node, v, has degree = 0(nr^(n)); (ii) = G(n^r^(n)). 
Consequently, we immediately see that the volume doubling property holds since RGG is evidently 
close to a 2D mesh in terms of volumes at the different radii except for a log(n) factor. 

Constant Resistance Property: For any subsets A d B d V consider an electrical network 
with one-ohm resistors for each edge on the graph G = {V,E). Define the resistance, p{A,B), 
between A and B as the power dissipated when a one-volt potential is applied to all the nodes in 
A and the nodes in the complement of B, i.e., B'^ are all grounded. The graph G = (V, E) is said 
to satisfy the constant resistance property if: 

Ce < p{B{u, R), B{u, MR)) < C7 (25) 

where M is any number larger than one and Ce and C7 are constants that can depend on M but 
not on R. 

Again consider first the example of a 2D mesh. Due to symmetry all the nodes at distance R + A 
have the same potential. Consequently we can short all the nodes at this distance. Due to the 
geometric neighborhood property, there are about R + A nodes connected to nodes at a distance 
R + A — 1. Since this is a parallel set of resistances the effective resistance is 1/{R + A). Summing 
over these resistances we obtain, 

1 MR 
p{B{u,R),B{u,MR)) = ^^-^^ «log( — ) = log(M) 

which establishes the fact. A similar but more elaborate argument is required for RGG. Basically 
the short cut principle along with the geo-dense property ensures a lower bound of Q{log(M)). To 
obtain an upper bound we need to construct a flow along the lines of [4] that satisfies the Kirchoff 
current law. 

Uniform Isoperimetry Property: We consider the subgraph, G{u,R), formed by restricting 
the graph G = {V,E) to the subset of vertices in the ball, B{u,R). Consider any partition of 
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G{u, R) into S, S'^. We say that the graph G = (V, E) satisfies a uniform isoperimetry property if 
for every u and every R we have, 

Cs ^ Cut{S,S-) 

R - mm{Vol{S),Vol{S^)) ^ ' 

where Cg is some constant independent of R and Cut{S, S'^) = YlueS veS" -'■u.v 

For the 2D mesh this is a well-known property (see [10]). The corresponding property for an RGG 
is a direct consequence of Theorem 4.1 of [4]. 

We are ready to state our result. 

Lemma 6.2 Consider a graph G = (V, E) that is in general infinite and satisfies the properties de- 
scribed in Equations 24, 25, 26, then the t-step transition probability satisfies the Gaussian estimate 
described in Equation 22. 

Proof. The proof is a direct consequence of the results in Merkov [17] and Grigoryan and Teles 
[13]. Theorem 3.1 in Grigoryan and Teles [13] states that if a graph G = (V, E) satisfies the volume 
doubling property, the resistance property and the Elliptic Harnack Inequality, the t-step transition 
matrix satisfies Equation 22. Merkov [17] shows that the isoperimetry property implies the Elliptic 
Harnack inequality. □ 



6.3 Message and Time Complexity for Achieving Consensus 

We will utilize Theorem 5.1 to characterize message and time complexity for achieving consensus 
in general graphs. For general graphs we note from the resistance arguments of Equation 15 that, 

< 2nlog(n/7)Di„£«/9* =^ T < + 0{diam{G)) < 2n\og{nh)D^^p* + 0{n) 

As we described earlier this bound is not useful for characterizing message complexity. To obtain 
better bounds we restrict our attention to graphs satisfying volume doubling, constant resistance 
and uniform isometry described in the previous section. We note that the message complexity from 
Theorem 5.1 can be bounded as: 

M(T^) < Mc(T^) + 27 ^ < 0(- \og^{n)) + 27 ^ 

vev ^ vev 

where, dy is the degree of node v and we have used Equation 18 to determine a bound on Mc(T^). 
We now let 7 = log(n). It follows that the message complexity for the two phase scheme is: 

M(T^) < 0(nlog(n)) + 27^(i„ =^ M(T^) < 0(n log(ra)) 

vev 

where in the final inequality we have used the fact that d„ < 4 for two-dimensional Grid graphs. 
The time complexity for grid graphs follows from Equation 6, 

T < + 0(diam(G)) < 0(n) 
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where we note that T-y for 7 = 0(log(n)) scales as 0{n) and 0(diam(G)) scales as 0{-\/n). 

For RGG we note that with high probability the number of links is of 0(log(n)). Consequently, 
following along the same lines as the previous computation for 2D grid graphs we obtain 

M(T^) < 0(nlog^(n)) 

By noting that for RGG diam(G) = 0{^/n/ \og{n)) we get a bound on the time complexity: 

T < 0(nlog(n/7)) + 0(diam(G)) < 0(n) 

6.4 Numerical Results 

Numerical verification of the analytical results of Table 1 on a 2 — d torus is presented in Fig. 4. 
Figure 4 provides a summary of numerical simulations for time and message complexities of CRW 
on 2-dimensional tori of varying sizes. 

An important consideration is that GOSSIP achieves consensus at all nodes while SRW and CRW 
realize their solution at a random node. Therefore, strictly speaking for the comparisons to be 
meaningful we need to add the time and message complexities to obtain similar consensus estimates 
for SRW and CRW. We can obtained consensus through CFLD. The time complexity of CFLD for 
torii scales as 0(-^/n), which is insignificant relative to time complexity of SRW/CRW. Message 
complexity-per-node of CFLD on torii scales as 0(log(n)), which is again insignificant relative to 
message complexity of CRW 0(log^(n)). Consequently, the qualitative nature of the plots is similar 
even when we incorporate these additional costs. 

We also simulate numerically time and message complexity for random geometric graphs. To 
simulate a 2 dimensional geometric random graph we distributed n nodes in a unit square and 
formed edges whenever two nodes were at a distance smaller than y^2\og n/n. We discarded 
graphs that were not connected. Again to compare CRW/SRW against GOSSIP consensus costs 
must be incorporated. We can obtained consensus through CFLD. The time complexity of CFLD 
for RGG scales as 0{^/n/ log(n)), which is insignificant relative to time complexity of SRW/CRW. 
Message complexity-per-node of CFLD on RGG scales as 0(log(n)), which is again insignificant 
relative to message complexity of CRW 0(log^(n)). Consequently, the qualitative nature of the 
plots is similar even when we incorporate these additional costs. 

We next describe Gossip algorithm as studied in [7] for the sake of completion. Gossip algorithms 
refer to distributed randomized algorithms that are based on pairwise relaxations between randomly 
chosen node pairs. In the present context a pairwise relaxation refers to averaging of two values 
available at distinct nodes. In what follows a stochastic matrix P = [Pij]nxn is called admissible 
for G if Pij = unless nodes i and j are neighbors in G. The algorithm is parameterized by such 
aP: 

Algorithm GOSSIP-AVE(P): Each node i maintains a real valued variable with initial value Zi{0) = 
Xi. At the tick of a local Poisson clock, say at time to, node i chooses a neighbor j with respect to 
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Figure 4: Average execution times and message complexities per node for CRW and GOSSIP on the 2- 
dimensional torus with n nodes. Note for an accurate comparison time and message complexity of CFLD 
needs to be added to that of CRW. Nevertheless, both time and message complexity are insignificant relative 
to that of CRW (see Section 6.3) for the torus. 



the distribution (Pij : j = 1, 2, ■ • • , n) and both nodes update their internal variables as Zi(to) = 
Zj{to) = {zi{t~) + Zj{t^))/2. We associate each node with a real value and consider the problem of 
computing its mean value. In order to make a fair comparison of GOSSIP-AVE with CRW and SRW we 
need to use a stopping criterion for GOSSIP-AVE. Let x denote the average of xi, a;2, • • • , x„, let z{t) 
denote the vector {zi{t), Z2{t), • • • , Zn{t)) of node values at time t, and 1 denote the vector of all Is. 
Define as the kth time instant such that some local clock ticks and thereby triggers messaging 
in the network. For £ > let the deterministic quantity K{e, P) be defined by 

K(e,P)=supinf|fe : f ~,f ^"^ ^ ^ e] . 

In [7] K[e, P) is considered as a termination time for Algorithm GOSSIP-AVE(P) and minimization 
of K{£, P) is sought by proper choice of P. Here we adopt the same interpretation for compar- 
ison purposes. It should perhaps be noted here that this is a fairly weak stopping criterion as 
\\z{Tx[e,P)) ~ ^l)||oo/|S| may be much larger than e. 

The numerical results of average number of messages and run times for 2-D torus appear in Figure 4. 
The corresponding results for geometric random graphs and a illustrative comparison with Gossip 
is presented in Fig. 5. We have also plotted a bound 21og(n) for comparison purposes. Notice that 
from the scale of the two plots it should be clear that the bound will have a similar qualitative 
relationship to the message complexity for the torus. These bounds reveal that the empirical 
per node message complexity appears to be closer to 0(log(n)) which is much smaller than the 
0(log^(n)) theoretical message complexity bound of Equation 18. One possibility for this difference 
is that our theoretical message complexity bound is for worst-case distribution of initial node values, 
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Figure 5: Average run times and number of messages per node on random geometric graphs (RGG). 
The solid curve represents simulation results for CRW, the dashed curve for SRW whereas the dotted curve 
represent lower bounds for GOSSIP-AVE(P) based on a lower bound for K{e,P). Note also that exact value 
of Fn{xi, X2, ■ • • ,Xn) is obtained at the termination of CRW or SRW whereas no such claim can be made for 
GOSSIP-AVE(P). Note for an accurate comparison, time and message complexity of CFLD needs to be added 
to that of CRW. Nevertheless, for RGG both time and message complexity are insignificant relative to that 
of CRW (see Section 6.3). 



while the empirical result is for an average case distribution of the node values. 



7 Appendix 

7.1 Proof of Theorem 6.1 

We follow the argument of [8] and provide a detailed proof to point out that the proof goes through 
for general graphs. The proof applies to both discrete and continuous settings and basically utilizes 
Markovianity. Let Asit) : t > 0, B C V denote the occupied nodes (state) at time t of a coalescing 
random walk whose initial state is B. Observe that irrespective of the initial state, B, if < s < t, 
E{\ABm<E{\^B{s)\). 

We then have the following lemma: 

Lemma 7.1 Suppose {Xt | t > 0) is a simple symmetric unit rate continuous time random walk 
on graph T and B C A C V , then 

E[\AB{s)\]<\B\-{\B\-l)as{A). (27) 

Proof. If i? = 0, Equation (27) trivially holds therefore assume B is non-empty. Our approach 
is to find an upper bound for the number of coalescences occurring in the time interval [0, s]. Our 
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analysis begins with starting random walks with active tokens at f G B\w and at w, and see 
whether their paths ever meet. To formalize this approach define an indicator function X(-) to 
indicate whether or not active tokens v and w meet in time t. Then, 

Z{s)= J2 AC^,w<s) (28) 

vEB\w 

the random quantity Z{s) is the number of active tokens B\w which coalesces with w at some time 
<t < s. It follows through conservation of active tokens that, 

|Ab(s)| < \B\-Z{s) 

Now, 

E{Z{s)) = V Prob{Cyy, <s)> min Prob{Cy^ < s){\B\ - 1) = asiA){\B\ - 1) 

v,weB 

The result now follows by taking expectations on both sides in Equation 28 and substituting the 
above expression. □ 

Now consider the partition Ai,i = 1, 2, . . . , ■m{t) of the vertices of the graph as in the hypothesis 
of the theorem and let Bj = Aj Ci B. 

Lemma 7.2 \Ab{s)\ < J2T=i \^Bj{s)\ V s > 

Proof. Let Cij{s) be the number of active tokens starting in Ai and coalescing with active tokens 
starting in Aj. Then, 

m{t) m{t) m{t) m{t) m{t) 

\ab{s)\ = J2 -EE %(^) ^ Ed^ii - ^^•^•(^)) = E (29) 

j=l i=l j=l j=l j=l 

□ 

Now using the Markov property we can upperbound the number of active tokens at any time as 
follows. In the beginning all the nodes of the graph G = {V, E) are active. Hence we need to 
analyze N(t) = £^[|Ay(t)|]. Suppose Oi d V he an arbitrary subset of V. Since V is finite the 
collection of all subsets, {Oj, i G X}, can be indexed by a finite index set X. Denote Oij = Aif\ Oj. 
It follows that, 

N{t + s) = E[\Avit + s)\] (30) 
= E[E[\Avit + s)\\Avm = ^ProbiAv{t) = Oj)E[\Av{t + s)\\Av{t) = Oj] 

Y,ProbiAy{t) = Oj)E[\AoM] 

(m(t) \ /'rn(t) 

E E [|Ao,nA,(^)|] =^ProbiAv{t) = Oj) E ^ [\^o,M\] 
i=i J jei \i=i 
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where, (a) follows from Markovianity, (b) follows from Lemma 7.2. Next, since for all i,j we have 
Oij G Ai G V we can apply Lemma 7.1 and obtain: 

Tn{t) m(t) 
i=l i=l 

m{t) 

= (1 - as{At)) J2 + m{t)as{At) = (1 - a,(A))|Oj| + m(t)a,(A) 

i=l 

Substituting this result in the inequality (30) we obtain 

Nit + s) = E[\Avit + s)\] < (1 - asiAt))^Prob{Av{t) = Oj)\Oj\ + m{t)as{At) 

jex 

= (1 - as{At))E[\Av{m + m{t)as{At) (31) 
<(l-^)E[|AvWll<cxp(-=?i^)^W 

where (a) follows from the choice of the number of partitions that it is one half of the number of 
active tokens at time t. To prove Equation 13 we note from Equation 31 that for any r and s such 
that t<r<r + s<2twe have, 

Nir + s) < {l~as{At))N{r) + m{t)as{At)<{l-as{At))N{r) + "'^N{2t) 

< (1 - as{At))N{r) + ^N{r) < (l - ^) N{r) < exp (-^) N{r) 

where the third inequality follows from the fact that since r < 2i we have N(2t) < N(r). Now 
iterating over s [|J Equation 13 follows. 



8 Proof of Theorem 6.3 



We first consider the case where 

2<m<'^ (32) 

This implies that N{t) > 8 and 2N{2t) > N{t). If these assumptions are violated then we are in 
the case where either N{t) < 8 or 

N{2t) < ^N{t) 

First, consider the situation when Equation 32 is satisfied. We will choose the collection At and 
the time step s so that assumptions underlying Equation 13 are satisfied. Specifically, we let At be 
the collection of balls B{u,Rt) of radius Rt for suitable vertices u e V to cover the graph G. We 
select radius, Rt as follows: 

I 8^^^ 2 
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where Co is the constant satisfying Equation 19. We can assume that, s < t/2. This is because if 
this condition is violated then, we have 

128ra 



N{t)< 



Cot 



(33) 



which satisfies the condition of the Theorem and there is nothing to prove. 
So we suppose s <t/2. The number of partitions, 



n 



iCoRt 



+ 1 < 



N{t) 



+ 1 < 



Nit) 



N{2t) < N{t) exp 

N{t) t 



ft 



n log(i) 



8n 



2R^J \og{Rt) 
, t>2 



This ensures that assumptions underlying Equation 13 are satisfied. Consequently, we get 

t \ C2 



Denoting 

and substituting for 



8t 



CoN{t) VCo/tMO 



we get. 



„ ^ .21og(t) 
f2t < /tTTZ^expi- 



log(2t) 
</texp(log(2)- 



Co 



2EIJ \og{Rt) 

Coftlogjt) 
16 



</.exp(log(2)-(^) 



Co 



log(i?t) 



C2 



8t 



</texp(log(2)-^/t 



log(t) 



8 -log(^)-log(/,) + log(j^); 

where the second inequality follows from the fact that log(i)/log(2i) < 1. Now we note that 
ft ^ t/log{t). Consequently, if ^ < ft then 

log(t) 



log(^)-log(/,) + log(j^) 



> 1. 



Also simultaneously if ft > ^q^qI^ we get f2t < ft- On the other hand if any of these conditions 
are violated we get 



8 81og(2) 
f2t < 2max( — , _ ^ ) = C. 



'Co C0C2 



This implies that. 



/2t < max (C, /t) ^ N{2t) < max (c 



nlog{2t) N{t) 
2t ' 2 



(l + l/log(i)) 



Finally, we have two cases to consider: (1) if Equation 32 itself is violated we have, N{2t) < ^N(t); 
(2) Equation 33 holds. In all of these cases Eq. 21 is satisfied and the result follows. 
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8.1 Proof of Lemma 6.1 



Proof. The proof follows directly along the lines of Pettarin et al [18] (Lemma 9). We provide a 
brief sketch of their proof here for the sake of completion. First, let Pt{x, y) denote the probability 
that a walk starting at node x at time zero is at node y at time t. Now consider two walks, one 
starting at node u and another starting at node w at time zero. Note that the two walks are 
independent and they have their own corresponding transition probabilities. Let N{u,w,To) be 
the mean number of times the two walks starting at u and w meet in the time interval [0, To]. Then 
noting that the two walks could meet at any node v at any time t G [0, Tq] we obtain. 

To 

u,v)Pt{w,v) 

t=0 V 

This is because Pt{u, v)Pt{w, v) is the probability that both walks starting at u and w respectively 
are at the same node v at time t. Summing over the different possibilities leads to the above result. 
Using this fact Pettarin et al establish that, 

max^ei3(«,i?) ^(^> v, R^) 

Here, iV(f , R?) is the number of times two walks starting at the same node v meet again in 
the time interval [0,i?^]. The problem now boils down to lower bounding N{u,w, R'^) and upper 
bounding N{v, v, R^). We are now ready to substitute the Gaussian t-step bounds to establish the 
result. Specifically, let 

D = {v (^V\ d{v,u) < 2R, d{v,w) < 2R} 

We also note that since B{u,R) C D and the graph satisfies the geometric neighborhood property 
we have |D| > Cq-R^. So 

N{u,w,R') = Y,^Pt{u,v)Pt{w,v)> J2 E^* 

t=0 V t=B?/2+lv&D 

By bounding (fiv^u) and (f{v,w) with 4R^ we obtain N{u,w,R^) = Next we use the fact 

that there are no more than CikA nodes in any annulus of size A at distance k to obtain an 
upper bound for N{v, v, R^). Specifically, by taking an annulus of size one, our geometric condition 
implies that there are no more than CiR nodes at distance R. So, 

T T t 

N{v,v,T) = ^^Pi(i;,x)Pt(i;,x)<l + ^^ ^ Pt{v,x)Pt{v,x) 

t=0 X t=l k=l d(v,x)=k 

/r',.\2 /^2A;2 



t 
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The computation of the above sum follows along the same lines as in Pettarin et al (Lemma 9). 
It follows that N{v,v,T) = 0{log{T)) which is 0(log(E)) for T = R. Consequently, there is a 
constant C2 such that, 

aR2{B{u,R)) > N{u,w R^) ^ C2 ^ 

^' " - m8oc^^B(u,R)N{v,v,R^) log{Ry ^' ' 

□ 
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